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ON LIPSCHITZ EXTENSION FROM FINITE SUBSETS 


ASSAF NAOR AND YUVAL RABANI 


Abstract. We prove that for every n £ N there exists a metric space (X,dx), an n-point subset 
S C X, a Banach space (Z, || • || z) and a 1-Lipschitz function / : S —> Z such that the Lipschitz 
constant of every function F : X -A Z that extends / is at least a constant multiple of y/logn. This 
improves a bound of Johnson and Lindenstrauss IJL84I . We also obtain the following quantitative 
counterpart to a classical extension theorem of Minty [Min70| . For every a £ (1/2,1] and n £ N 
there exists a metric space (X,dx), an n-point subset S C X and a function / : S —¥ ^2 that is 
o-Holder with constant 1, yet the a-Holder constant of any F : X —» £2 that extends / satisfies 


ll*1lLip(a) £ (log n ) 


+ 


/ log n 
V log log n 


We formulate a conjecture whose positive solution would strengthen Ball’s nonlinear Maurey exten¬ 
sion theorem IBa.1921 , serving as a far-reaching nonlinear version of a theorem of Konig, Retherford 
and Tomczak-Jaegermann IKRTJ801 . We explain how this conjecture would imply as special cases 
answers to longstanding open questions of Johnson and Lindenstrauss |JL84| and Kalton |Kal04j . 


1 . Introduction 

Given two metric spaces (X,dx) and (Z,dz), the Lipschitz constant of a mapping f : X Z 
will be denoted below by ||/||Lip- For every subset SCI, let e(X,S,Z) denote the infimum over 
those K £ (0, 00] with the property that for every / : S -A Z there exists F : X -A Z with 
11E11Lip ^ K\\f\\ Lip and whose restriction to S satisfies F\s = /. In its most general form, the 
Lipschitz extension problem asks for estimates on the quantity e(X, S, Z). 

It is of great interest to obtain geometric conditions on the metric spaces (X,dx) and ( Z,dz ) 
ensuring that e(X, S, Z) < 00 for every S C X. Formally, define e(X, Z) to be the supremum 
of e(X,S,Y) over all S C X. When e(X,Z) = 00, it is natural to refine the Lipschitz extension 
problem by taking nCN and defining e n (X, Z) to be the supremum of e(X, S, Z) over all SCI 
of cardinality at most n, and asking for the asymptotic behavior of e n (X,Z) as n A 00. Another 
natural quantitative refinement of the parameter e(X, Z) is to fix e € (0,1] and define e £ (X, Z ) to be 
the supremum of e(X, S , Z) over all S C X that are e-discrete in the sense that dx(x , y) ^ e-diam(5) 
for every distinct x,y £ 5, and asking for the asymptotic behavior of e £ (X, Z) as e A 0. 

Denote the supremum of e n (X, Z) over all metric spaces (X, dx) and all Banach spaces (Z, || • \\z) 
by oe(n). Here “ac” stands for “absolute extendability,” where we are following the notation and 
terminology that was introduced in [LN04j . Thus, if uc(n) < K then for every n-point metric space 
(. M,dM ), every Banach-space valued 1-Lipschitz function defined on M can be extended to any 
metric space that (isometrically) contains (M, dM) so that the Lipschitz constant of the extended 
function is at most K. One can similarly define for e > 0 the quantity oc(e) by considering the 
supremum of e £ (X,Z) over all metric spaces (X,dx) and all Banach spaces ( Z , || • \\z). 
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The finiteness of ac(n) and ac(e) for every n G N and e € (0, 1] is well-known. Johnson, Lin- 
denstrauss and Schechtman proved in |JLS86] that ae(ro) < logn, where here, and in what fol¬ 
lows, the notation A < B (respectively A > B) stands for A ^ CB (respectively A ^ CB) for 
some universal constant C € (0, oo). We shall also use the notation A x B when A < B and 
A > B. The above upper bound of [JLS86] on ac(n) was more recently improved [LN041ILN05] 
to ae(ro) < (logn)/log logn, which is the best-known upper bound on ac(n) to date. Determin¬ 
ing the asymptotic behavior of oc(n) as n ^ oo remains a major open problem. The previously 
best-known lower bound on ac(n) is ae(n) > y/ (log n) / log log n, obtained over thirty years ago by 
Johnson and Lindenstrauss |JL84| (see also |MM10j for a variant of the construction of Johnson and 
Lindenstrauss that also yields the same lower bound). Here we obtain the following improvement. 

Theorem 1. For every n € N we have ac(n) > y/\ogn. 


Thus, the best-known bounds for ae(n ) are now 


\Aogn < ac(n) < 


log n 
log log n 


(1) 


It would be very interesting to improve any of the bounds in (HD, the ultimate goal being to 
determine the rate at which ac(n) tends to oo as n — > oo. 

In [JL84 ] it was shown that ac(e) < 1/e, and a different and very simple proof of this fact was 
given in |JLS86| . In |MN13a| it was proved that ac(e) > 1/e, via a quantitative refinement of a 
beautiful argument of Kalton |Kall2| . Thus ae(e) x 1/e. The example that leads to Theorem [1] also 
yields a new proof that ac(e) > 1/e which some may find to be somewhat simpler than the proof 
in |MN13a| : see Remark [HI below. We note, however, that the example of jMN13a| has important 
properties (see the discussion in Section 1.6 of IMN13a| ) that Theorem Q] does not imply. 

The value of Theorem [Tj does not stem only from the fact that it yields an asymptotic improve¬ 
ment over the best known bound, but rather because this improvement relies on a conceptually 
different approach than that of |JL84j . In fact, as emphasized explicitly in [JL84] . the difficulty to 
improve the lower bound on ac(n) to a constant multiple of y/\ogn is not purely technical, and the 
approach of (JL84 j inherently cannot yield a lower bound that is better than o(vdogn). 

To explain the above assertions, we need to briefly sketch the argument of [JL84] . Fix k € N, 
L > 0 and e € (0,1/2). Suppose that ( X , || • ||x) is a Banach space and Y C X is a fc-dinrensional 
linear subspace of X. Let lsf £ be an e-net in the unit sphere of Y and denote S e = N e U {0}. 
Taking / : S £ —>■ S £ C Y to be the identity mapping, it is proved in [JL84J that if F : X —>• Y 
is an L-Lipschitz extension of / and e is sufficiently small then there exists a linear projection 
P : X —» Y with ||P|| < L. Hence, if for some A > 0 one knows that ||P|| ^ A for every linear 
projection P : X —>• Y then one concludes that L > A. Suppose that A > y/k; since the classical 


Kadec'-Snobar theorem |KS7l| (see also |Woj91, §III.B]) asserts that we always have A ^ y/k, 


this lower bound on A is the best one could hope for. By standard bounds on the size of e-nets 
(e.g. [MS86| ). if we set n = |S e | then logn x ATog(l/e), so we have L > yj (logn)/log(l/e). For 
this strategy to yield a lower bound that is a constant multiple of yTogn, one needs to take e to be 
a universal constant, but in this case we would have L < ae(2e) < 1/e = 0(1). So, in order to get 
a lower bound on L that tends to oo with n one must have e = o(l) as k —» oo (indeed, in |JL84| 
the choice of e is log(l/e) x log A: x log logn). This explains why one cannot prove Theorem [1] via 
the above strategy of reduction to the nonexistence of linear projections of small norm. Note also 
that due to the Kadec'-Snobar theorem [KS71j . this “linearization” approach cannot yield a lower 
bound on ac(n) that tends to oo at a rate that is faster than a constant multiple of y/Togn. 


Here we overcome the above obstacle by abandoning entirely the desire to reduce the problem 
to the nonexistence of linear projections of small norm. We consider a family of metric spaces that 
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arise from (a modification of) expander graphs, and our poorly extendable functions take values 
in their associated Wasserstein 1 spaces. The proof relies on direct geometric considerations, in 
particular the use of Banach space-valued Poincare inequalities (see Lennna[8] below), rather than a 
reduction to linear mappings. Our approach is inspired by a certain algorithmic clustering problem 
for graphs, specifically by leveraging the difference between two relaxations of the corresponding 
combinatorial optimization problem. This motivation is explained in Section [2] below, though the 
proof of Theorem Q] (in Section [3]) is direct and does not rely on any algorithmic background. 


1.1. Hilbert space-valued Holder functions. Suppose that (X,dx) and ( Z,dz ) are metric 
spaces and a € (0,1]. We shall denote below the a-Holder constant of a mapping / : X —>• Z by 
II / llupfo)• Thus ||/||Li P (a) is the infimum over those L € (0, oo] for which every x,y € X satisfy 
dz{f{x), f(y)) ^ Ldx(x,y) a . Equivalently, ||/||Li P (a) is the Lipschitz constant of / when it is 
viewed as a mapping between from the metric space (X,d^) to the metric space ( Z,dz )• Denote 


e a (X,Z) d = e((X,d%),(Z,d z )) and VnG N, 


e n(X, Z) = e n ((X,dx), {Z,dz))- 


Thus, e a (X, Z) is the infimum over those K € [1, oo] such that for every 5CI and every / : S —» Z 
there exists F : X —>• Z with F\g = f and ||T)| Lip ( Q ) ^ AT||/|| Lip ( a ), and analogously for e°(X,Z). 

A classical theorem of Minty |Min70| asserts that if H is a Hilbert space then e a (X, H ) = 1 for 
every metric space (X,dx) and a € (0,1/2] (equivalently, e“(^oo,^ 2 ) = 1 for all n € N). Minty’s 
theorem fails when a € (1/2,1] (see |HW71| ). but understanding what happens when a € (1/2,1] 
remains a mystery. Specifically, Kalton conjectured in IKal04] that for every a € [1/2,1] and n € N, 


e a (£ooJ2)<n a -X 


( 2 ) 


This conjecture appears as Problem 11.3 in |Kal04jpl . See the discussion immediately following 
Problem 11.3 in |Kal04| for an interesting geometric application that © would imply. The esti¬ 
mate (J2D holds true when a = 1/2 due to Minty’s theorem, and also when a. = 1 by the fact that 
is y^n-isomorphic to combined with an application of the nonlinear Hahn-Banach theorem (see 
e.g. jBLOOl Lemma 1.1]). Thus ([2]) is a natural conjectural interpolation between Minty’s theorem 
and the nonlinear Hahn-Banach theorem. 


Kalton’s conjecture ([2j) remains an interesting open problem, and in Section 11.21 we formulate a 
conjecture whose validity would imply the validity of ([21). In the reverse direction, to the best of 
our knowledge its isn’t known whether ([2j) would be sharp. We therefore ask the following question. 


Question 2. Is it true that e“(-foo,^) ^ 2 f° r every a € (1/2, 1)? 


We also ask the following variant of Question [2] for the parameter e“(-£oo, £ 2 )- 

Question 3. Is it true that e“(£oo ; ^ 2 ) ^ (logn ) Q- 2 for every a € (1/2,1)? 

By mimicking an argument of [JL84] one sees that a positive answer to Question [3] would imply 
a positive answer to Question [2] Indeed, for every metric space (A, dx) and a € (0,1] we have 

e^n(X,£ 2 )^3e a (£ 00 ,^). (3) 

(The choice of constants in ([3]) is somewhat arbitrary; see below.) Since ([3]) is not stated explicitly 
in [JL84], we shall now briefly explain how it is proved. Fix S Cl with IS) ^ 2' n and f : S —>■ £ 2 . 
Since /(S) is a subset of a Hilbert space of cardinality at most 2 7n , by the Johnson-Lindenstrauss 
dimensionality reduction lemma |JL84j there exists a mapping g : f(S) —>• £2 such that 

Vo,6e/(S), \\a-b\\ 2 ^\\g(a) - g(b)\\ 2 ^3\\a-b\\ 2 . (4) 

7 We note in passing that there is a misprint in [Kal04l Problem 11.3]: in the two displayed equations that appear 
there the exponent a erroneously appears in the left hand side of the inequality rather than in its right hand side. 
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The specific parameters in the Johnson-Lindenstrauss lemma that we take here are valid due to 
the bounds in |Ach03l Theorem 1.1]. By the Kirszbraun extension theorem ]Kir34| . 


3 g 1 :£%—>■ £2 such that g 


-1 


g°f(S) 


= g 


-1 


and 


,-1 




-11 


Lip 


I Lip 


< i, 


(5) 


where the last step of © uses the left hand inequality in (pf|). Also, by the definition of e“ (£qo ; £■■ 


2 h 


3 g o f : X —> £2 such that go f = g o f and go f 

s 

Thus, the following diagram commutes. 


Lip(o) 


^ e a {£oo, @ 2 ) \\g o f\\up(a) ■ ( 6 ) 



The mapping g 1 o g o f : X —>• £2 therefore extends / and satisfies 


g 10 go f 


g- 1 


go f 


Lip(a) 


Lip 



Lip(a) 


©A© 

< e a (^oo,^)lbo/|| Lip(a) 


^e a (£ 00 ,£^ 


Lip||/||Lip(a)^3e“(^ 00 ,^)||/|| Lip(a) , (7) 


where the last step of (© uses the right hand inequality in (pfj). Clearly © implies (©. 

The link between Question [2] and Theorem [T| is twofold. Firstly, Question [2] presents another 
situation in which the linearization strategy of jJL84j for proving extension lower bounds seems 
to be insufficient, because bounded linear maps are Lipschitz rather than a-Holder for a G (0,1). 
Nevertheless, in Section 11.21 we do show how to use a linearization procedure in the spirit of |JL84) 
to bound e"(£ 00 ,^ 2 ) from below. However, our approach requires a substantial modification of the 
argument of fJL84j and at present we do not see how it could yield a nontrivial result for the entire 
range a £ (1/2,1] (the argument of Section f4.21 yields nontrivial bounds only when a ^ ljy/2). The 
second link between Question [2] and Theorem Q] is that the bounds that we obtain in Theorem [4] 
below are based on a modification of a construction of |.TLS86| which is part of a general family of 
constructions that were used in related contexts also in |Lan991 ICKR05| , and to which the example 
that underlies Theorem |T] belongs as well. We elaborate further on this in Remark fl9l below. 

Here we obtain the best known lower bounds in the context of Question [2] and Question [3] 


Theorem 4. For every a € (1/2,1] and every n € N we have 

e“(4o,^2)>n^+n“ 2- 5 and e%(£ 00 ,£ 2 ) > (logn) 2 ^ + 


( log n 
\log logn 




( 8 ) 


Importantly, the lower bounds in (© tend to 00 with n for every a € (1/2,1]. A positive answer 
to Question [2] (respectively, Question© for a = ^ +e would yield the lower bound e a (£oo,£%) > n e 
(respectively, e‘^{£ 00 ,£ 2 ) > (logn) £ ), while Theorem © implies that 


e^ +£ ( 


'OO) t-2 


£V 


> 


n 


e-2e A 


and el (4o,^) > (log n) 


e—2e z 
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i.e., the exponents of Theorem 0] match what Kalton’s conjecture predicts up to lower order terms 
as a —>• l/2 + . The exponent a 2 — 1/2 in the second summands in ([8]) becomes positive only 
when a ^ 1/V% and it becomes greater than the exponent of the first summands in (j5|) ? namely 
(2a — l)/(4 a), only when a £ (cko, 1], where ao = 0.837... is the largest root of the polynomial 
4x 3 — Ax + 1. As a tends to 1 the exponent a 2 — 1/2 tends to 1/2, so the bounds in (JHD yield 
an alternative possible interpolation between Minty’s theorem and the nonlinear Hahn- Banach 
theorem. We do not believe that this interpolation is sharp and it seems more likely that Kalton’s 
conjecture, and correspondingly Question [2] and Question [3l have positive answers. It may very 
well be the case that the example that we analyse in Section 14.21 is itself an example that yields a 
positive answer to Question EJ but at present we do not know how to prove this. 


1.2. A conjectural strengthening of Ball’s extension theorem. A longstanding open prob¬ 
lem posed by Johnson and Lindenstrauss in [JL841 Problem 1] asks whether or not for every 
p £ (1,2) and every n € N we have 

e(W2) ~p nv~^. (9) 

Here, and in what follows, the notation A < p B stands for A ^ C(p)B , where C(p ) £ (0, oo) is 
allowed to depend only on p. The notations A> p B and A B are defined analogously. 

Johnson and Lindenstrauss were motivated to ask whether ({9]) holds true by a Lipschitz extension 
theorem of Marcus and Pisier (MP84j . which states that for every p £ (1,2) and n £ N we have 

en(£p,t2) < P (logn)WT (10) 


Specifically, the validity of ([9]) would yield a new proof of the Marcus-Pisier theorem (fTOl) through 
an application of the Johnson-Lindenstrauss dimensionality reduction lemma (we recalled how 
such arguments are carried out in the proof of Q above). Thus, the validity of ([9j) would yield 
an illuminating new perspective on the work of Marcus and Pisier |MP84j . who proved that (1101) 
holds true via an entirely different argument. 

Our goal in this section is to formulate a conjecture that implies both Kalton’s conjecture on 
the validity of ([2]) and the Johnson-Lindenstrauss conjecture on the validity of ((9|), in addition 
to a wealth of yet unknown Lipschitz extension results. In order to do so, we need to quickly 
recall Ball’s work |Bal92j (itself addressing another open question that was posed by Johnson and 
Lindenstrauss in jjL84j ) on the nonlinear version of Maurey’s extension theorem |Mau74| . 

For p,q £ [l,oo], the Rademacher type p constant and Rademacher cotype q constant of a 
Banach space (X, || • ||x), denoted T p (X) and C q (X), respectively, are defined to be the infirna of 
those T, C £ [1, oo] such that for every n £ N and every x±,... ,x n £ X we have 


1 

2A 


E 

ee{-l,l} n 


n 


E Z 3 X 3 

3 = 1 


P 

X 


< 


r p E 

3 = 1 


IIP 

lx 


and 


ElM* 

3 = 1 


E 

2 n / ^ 


y] e-jx 

3 = 1 


3^3 


Maurey’s extension theorem |Mau74j asserts that if ( X , || ■ ||x) and ( Z , || • \\z ) are Banach spaces and 
E C X is a linear subspace then for every linear operator U : E —>• Z there exists a linear operator 
V : X —>• Z that extends U and satisfies the operator norm bound ||Vj|x->-z 1$ T 2 (X)C 2 (Z)\\U\\e^z- 
In |Bal92| . Ball discovered a powerful nonlinear version of the Maurey extension theorem. In 
order to do so, he introduced notions of type and cotype for metric spaces that have since proven 
to be useful in several contexts beyond their original use for the Lipschitz extension problem. For 
p £ (0, oo), the Markov type p constant of a metric space (A, dx), denoted M p (X), is defined to be 
the infimum over those M £ [l,oo] such that for every n,t £ N, every n x n symmetric stochastic 
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matrix A = (ay), and every x ±,..., x n G X we have 

n n n n 

)i j d x (x i ,x j ) p ^ tM p ££ OijdxtXi.XjOP. 

2=1 J = 1 2—1 J = 1 

For g G (0, oo), the metric Markov cotype g constant of a metric space (X,dx), denoted N P (X), is 
defined to be the inhmum over those N G (0, oo] such that for every to, t € N, every nxn symmetric 
stochastic matrix A = (a^), and every xi, ..., x n G X, there exist yi,..., y n G X such that 

n n n n n ^ t 

£ Vi? + t £ £ a ijdx(yt: Vjf < «” £ £ j £(-4* )ijdx(xi,Xj) p . 

2—1 2—1 J = 1 2—1 J =1 S =1 

Ball’s extension theorem [Bal92j asserts that for every metric space (X,dx) and every Banach 
space (Z, || • || z) we have 

e(X,Z**)<M 2 (X)N 2 (Z). 

This formulation is not the strongest known version of Ball’s theorem, but it suffices for our pur¬ 
poses; see MX IMal for a more complete discussion. 

Remark 5. In the setting of Ball’s extension theorem one actually needs to know the Markov type 
of the complement of the subset from which one wishes to extend rather than the Markov type of 
the entire ambient space. Namely, we actually have 

e(X,S,Z**) <M 2 (XxS)N 2 (Z). 

This assertion holds true also for the more general version of Ball’s extension theorem that was 
obtained in |MN13aj . The above stronger statement follows effortlessly from an inspection of the 
proofs in |Bal92l IMNl.Saj , though it has never been stated in the literature; we believe that it is 
worthwhile to record it here for potential future applications. 

A beautiful strengthening of Maurey’s extension theorem for finite-dimensional targets was dis¬ 
covered by Konig, Retherford and Tomczak-Jaegermann [KRTJ80] . who proved that for every 
p G [1,2], q G [2,oo) and to G N, if ( X , || • ||x) and (Z, || • \\z) are Banach spaces with dim (Z) = to 
and E C X is a linear subspace, then for every linear operator U : E —» Z there exists a linear 
operator V : X —>• Z that extends U and satisfies 

WWx^z < P , q T p (X)C q (Z)n^\\U\\ E ^ z . (11) 

See also the expository article of Pisier |Pis79) for an elegant proof of (|1 1 j) . Important earlier special 
cases of the above result (other than Maurey’s extension theorem itself) can be found (via different 
proofs) in the works of Lewis |Lew78| and Figiel and Tomczak-Jaegermann (FTJ79) . 

In light of (fill) , we ask the following question. 

Question 6. Suppose that 1 ^ p ^ q < oo and that ( X , dx) is a metric space with Markov type p , 
i.e., M p (X ) < oo. Suppose also that to G N and that (Z, || • \\z) is an n-dimensional nornred space. 
Is it true that there exists a constant K = K(M p (X), N q (Z),p,q ) G (0,oo), which may depend 
only on the parameters M p (X), N q (Z),p, g, such that 

e(X,Z) < Knv~\ ? 

Since, by a straightforward application of the triangle inequality, every metric space (A, d x ) has 
Markov type 1 with M\(X) = 1, for every a G (0,1] the Markov type 1/a constant of (X, dy) 
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equals 1. Also, by |Ball3L lMN13aj . for every q G (l,oo) we have N max T q 2 \{£q) < yg/(g—T). 
Hence, a positive answer to Question [6] would imply that for every n G N we have 


1 

max{g, 2} 


1 



( 12 ) 


The case q = 2 of m is the same as m, i-e., Kalton’s conjecture is a special case of Question El 
By [Bal92 , NPSSQ6], for every p G [l,oo) we have A7] r . j n {p. 2 [ ( £p ) V QV This implies formally 
(directly from the definition of Markov type) that for a € (0,1] the Markov type min{p, 2}/a 
constant of the metric space (£ p , ||a; — y||“) is at most a constant multiple of p a / 2 . Hence, a positive 
answer to Question El would also imply that for every p,q € (l,oo) and every n € N we have 


min{p,2} 
max{g, 2} 




< 70 min 

n 


TpTT 


1_ 

*{9,2} 


(13) 


The case p € (1,2), q = 2 and a = 1 of (fl3l) is the same as ©, i.e., the above conjecture of Johnson 
and Lindenstrauss is a special case of Question El The validity of ifT3|) would complement the fact 
that e a (£ p ,£ q ) < 00 for every a £ (0, min{p. 2}/ max{g,2}], as shown in jNaoOl . NPSS06]. 

At present we see several obstacles to adapting the proofs in |Bal92L MN13a j so as to incorporate 
the finite dimensionality of the target in order to answer Question El We therefore leave Question El 
as an intriguing direction for future research, itself part of the Ribe program (see |Ball3L INaol2| ). 


1.3. Lipschitz extension between £ p spaces. Theorem Q] yields the best known lower bound on 
ae(n) as n —>• 00 , but there are several cases of special interest where the best known lower bound 
on e n (X,Z) is o(y / log n). Here we recall the best known bounds when X = £ p and Y = £ q for 
p,q G [l,oo], as a survey of very basic questions on Lipschitz extension that remain open. 

The case of Hilbert space-valued functions was famously studied by Johnson and Lindenstrauss 
in [JL84j . answering a question posed by Marcus and Pisier in |MP84| . The bounds on e n {£ 00 ,£ 2 ) 
that were obtained in [JL84] are as follows, and they remain the best known bounds to date. 


Vlogn 
Vlog log n 


< e n (£ 00 ,£ 2 ) < V l °S n - 


(14) 


In the special case of Hilbert space-valued functions defined on finite subsets of £ p for p G (1,2), 
the best known bounds are 


( log n 
\ log log n 


< 




< p (log n)t 


(15) 


The rieht hand inequalitv in (fT5l) is due to IMP.'' Ij and the left hand inequalitv in (USD is due 
to IJL84] . where it is shown to hold for every pG [1,2]. 

As shown by Makarychev and Makarychev in [MMlOj . the parameters e n (^oo,^i) and e n (£i,£i) 
have a special algorithmic significance. The best known bounds for these quantities are 


and 


Vlogn 
Vlog log n 

Vlogn 


< 


(£ 00 ; £ 1 ) 


< 


t (£i,£i) 


< 


logn 
log log n ’ 


log n 


(16) 


(17) 


i i 1 */ \ J- " J- / r^j i l 

log log n log log n 

The right hand inequalities in (fT6l) and (1171) are a special case of the general upper bound of |LN05| . 
The left hand inequalities in (1161) and (1171) appear in [MMlOj . with the left hand inequality of (1171) 
being based on the work of Figiel, Johnson and Schechtman |FJS88| (an asymptotically weaker 
lower bound in (|17D follows from earlier work of Bourgain }Bou81| ). 
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By Ball’s extension theorem [Bal92| . combined with the fact [NPSS06] that £ p has Markov type 2 
when p G [2,oo), ifl<g^2^p<oo then e(£ p ,£ q ) ^ C{p,q) for some C(p,q ) G (0,oo). Here the 
asymptotic dependence of C(p, q) as p —>• oo or q —>• 1 remains unknown. In particular, a famous 
and longstanding open question of Ball [Bal92| asks whether e(^ 2 >^i) is finite or infinite. The best 
known bounds on e n (£ p ,£ q ) for the remaining values of p, q G [l,oo] are as follows. 


p,qe [1,2] and q / 

1 2 : 

p,q£ (2, oo) = 
l^p^2^g<oo == 


/ log n \p 2 
\ log log n 


q S-n{£pi £q) iSp (log Tl) p , 


I_I 
(log n)p 2 


(log log n) p 


r S e n (£ P ,h) < P (log n)p. 


( lp g 


n ^ <e U £)< l ° gn 

rsj c 'n\ K 'P') ^q) 


\ log log n 

/ l \ maxi — — 

I logn \ IP 2 q 2 J 

\ log log n 


log log n ’ 


&n(^pi £q) (log Tl) P . 


(18) 

(19) 

( 20 ) 
( 21 ) 


The right hand inequalities in (|18j) . (|19j) . (|21l) are due to [LN05j . More generally, it was shown 
in [LN05| that for p G (1, 2] we have e n (£ p , Z) < p (log n) l ^ p for every Banach space Z. It would be 
interesting to determine the best exponent of log n in this context of general Banach space targets, 
as well as when the target Z is allowed to range over some special classes of Banach spaces (e.g., 
for Banach spaces of cotype 2 it is known [MNl.'laj that this exponent cannot be smaller than 1/4). 

No upper bound on e n (£ p , £ q ) that is asymptotically smaller than the general upper bound [LN05j 
of 0((logn)/log logn) is known when p,q G (2, oo), and similarly for e n {£\,£ q ) for q G (2,oo). 

Let (Z, || • || z) be a Banach space that has Rademacher type p for some p > 1. By a theorem 
of Figiel and Tomczak-Jaegermann [FT.T79| there exists K(Z) G (0, oo) such that for every n G N 
one can find an n-dimensional subspace Z n of Z that is 2-isomorphic to R) an d there exists a 
projection from Z onto Z n of norm at most K(Z). This implies that for any metric space X we 
have e n (X,Z) >z e n (X, £ 2 ). In particular, e n (Z,£ q ) > q e n (X, £ 2 ) for every q > 1. This implies the 
validity of the left hand inequality in (fl8l) . as a consequence of the left hand inequality of (fT5l) . 
However, this reasoning does not apply when the target space is £\, i.e., in order to prove the 
left hand inequality in (fl9l) . For this purpose, one argues by adapting the proof of the left hand 
inequality in (fl7l) that appears in [MMlOj (the key tool being [F.IS88] ): the adaptation of this 
argument is simple and we omit it (the result itself, however, is far from trivial). 

If ( X , || • ||x) is an infinite dimensional Banach space then, by Dvoretzky’s theorem [Dvo61l |. X 
contains a 2-isomorphic copy of £\] for every n G N. Consequently, e n (X,Z) > e n (£i, Z) for every 
metric space Z. In particular, the left hand inequality in (1201) follows from the special case p = 2 
of the left hand inequality in (1211) . The latter inequality consists of two lower bounds, one with 
the exponent 1/p — 1/2 and the other with the exponent (q — p)/q 2 . Since, as explained in the 
previous paragraph, e n (£ p ,£ q ) > q e n (£ p ,£2 ), the lower bound with exponent 1/p — 1/2 follows from 
the left hand inequality in (fl5l) . The lower bound with exponent (q — p)/q 2 is due to [Nao SH, with 
the explicit asymptotic dependence being computed when p = 2 in [LN051 Lemma 1.13], and when 
p G (1, 2) the corresponding bound follows mutatis mutandis by the same argument. 

It is worthwhile to point out here an interesting feature of the left hand inequality in m- 
Suppose that p G [1, 2|. For every n-dimensional subspace Y of £ p there exists a projection from £ p 
onto Y of norm at most c^n 1 ^" 1 / 2 . This assertion follows from the work of Lewis [Lew78j . and it is 
also a consequence (ED- Thus, for every Banach space Z, every linear operator U : Y —>• Z can be 
extended to a linear operator V : £ p —» Z with ||R||^ ?i 1//p ~ 1//2 ||b / )|y->.z. Consequently, the 



































linearization method of Johnson and Lindenstrauss [JL84j cannot yield a lower bound on e n (£ p , Z ) 
that is at least a constant multiple of (logn) 1//p-1 / 2 . However, ( q~p)/q 2 > 1 /p — 1/2 if and only if 


3 (l-V2p-3)“ (1- y/2p-3Y 

2 l + y/2^3 1 2(2 — p) 

So, if p, q satisfy (f22ll then the left hand inequality in (1211) shows that 


( 22 ) 


lim 6n( £H = oo, (23) 

n “* x> (logn)?"3 

i.e., we have a lower bound on e n (£ p ,£ q ) that is asymptotically larger than any bound that can 
be deduced by reducing the problem to the extension problem for linear operators. We conjecture 
that the restrictions in (f22ll are not needed here, i.e., (1231) holds true whenever 1 ^ p ^ 2 < q < oo. 
It even seems to be unknown whether for every p € [1,2] there exists a Banach space Z for which 


lim 

n—> oo 


*(e P ,z) 


= oo. 


(24) 


(log n)p 2 

The best-known result towards (1241) seems to follow from |MN13aj . where it is shown that there 
exists a Banach space Z for which e n (l2,Z) > ^/(log n )/ log log n. Hence, by Dvoretzky’s theo¬ 
rem [Dvo m , for every infinite dimensional Banach space X we have e n (X, Z) > yj (log n )/ log log n, 
implying that (1M1) holds true if p G [1,2] satisfies 1/p — 1/2 < 1/4, i.e., for every p € (4/3,2], 


2 . Algorithmic clustering 


An algorithmic optimization problem called 0 -Extension, which we describe below, served as 
inspiration for our proof of Theorem [0 In this section we shall survey this background so as to 
clarify the context and explain the “twist” over the existing approaches to 0-Extension that we 
introduce in order to prove Theorem [T] We stress, however, that our present work does not have 
new algorithmic implications, and the sole purpose of this section is to explain how this context 
motivated our approach to Theorem |T] Those who are interested only in the proof of Theorem |T] 
can skip this section on first reading: the proof itself appears in Section [3] below and is entirely 
self-contained, with none of the algorithmic background that we describe here being used. 

In what follows, the vertices of a combinatorial graph G are denoted Vq and its edges are denoted 
Eg- The 0-Extension problem is a clustering framework for finite graphs that was introduced 
by Karzanov IKar98| . The input of the 0-Extension problem is a graph G with edge weights 
w : Eg —» [0, oo), and a subset T C kg equipped with a metric dr : T x T —» [0, oo). The subset T 
is called in the literature the set of terminals. The output of the 0-Extension problem is a partition 
of Vq into |T| subsets {C x C Vg} x &t with the requirement that x € C x for every x € T. The cost 
of this partition is defined as follows. Every edge {u, u} € Eg with |{u, v} n C x \ = |{u, v} PI C y \ = 1 
for some distinct x,y € T contributes its weight w(u,v) times drr{x,y) to the total cost, and all 
other edges (i.e., edges that are entirely within one cluster C x for some x € T) do not contribute to 
the total cost. The goal is to find efficiently (in polynomial time) such a partition with minimum 
cost, or with cost that is guaranteed to be close to the minimum possible cost. Formally, define 


and 


COSTj-fj ^ ^) {{C x } x£ t) = f £ £ w(u,v)d T (x,y), 

x,yeT {u,v}eE a 

x+y |{u,u}nC !I |=|{u,u}nCy|=i 


OPT (G,w,d T ) 


def 


min 

{C x } xe T partition of Vg 
V xGT, x&C x 


COST {G , w , dT ) ({C x } x zt) ■ 


(25) 
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The goal is to find efficiently a partition {C x } x& x of Vq with x G C x for every x G T such that 

COST (G;W!dT) ({ C x } xeT ) 

OPT(G, w, d T ) 


is guaranteed to be at most a (hopefully small) value a G [l,oo). 

Obtaining a = 1 here would imply that P = NP, since in the special when w(u,v ) = 1 and 
dr(x, y) = 1 for every {it, u} G E G and every distinct x,y € T, the quantity COST^^^) ({CxIxgt) 
is nothing more than the total number of edges that are incident to distinct elements of the partition 
{Cxjxer- Computing the minimum cost in this special case is the MULTIWAY CUT problem, and 
it was shown in jDJP + 94 that if P ^ NP then there exists ao > 1 such that no polynomial¬ 
time algorithm outputs a partition whose cost is guaranteed to be less than «o times the minimum 
possible cost. For the 0-Extension problem in its full generality, it was shown in [KKMR09] that for 
every e G (0, 1) there exists C{e) G (0, oo) such that the existence of a polynomial-time algorithm 
for 0-Extension whose approximation factor on instances of size n is a ^ (log n) 1 / 4-6 would imply 
that any problem in NP of size n could be solved in time exp ((log n) c ^ £ )y 


Due to the above evidence for the nonexistence of a constant-factor approximation algorithm for 
0-Extension, the literature has focused on the design of approximation algorithms with potentially 
unbounded approximation factor, based on two competing continuous relaxations of the discrete 
optimization problem. The first is the metric relaxation , which was formulated in [Kar98| and 
studied extensively in |CKR05| . Given an instance of 0-Extension, i.e., an n- vertex graph G, a 
metric dx '■ T x T — >• [0, oo) on a subset T C Vq, and edge weights w : Eq —» [0, oo), define 


MET (G,w,d T ) 


def 


min 

d: Vq X Vq [ 0 ,oo) 
d\ r p=dj' 

d is a semi—metric 


E 

{u,v}eE G 


w(u, v)d(u, v). 


(26) 


The minimization in the right hand side of (126(1 amounts to a linear program because the constraint 
that d : Vq x Vq —>• [0, oo) is a semi-metric that extends dx corresponds to 0(n 3 ) linear inequalities 
in the variables {d(u,v)} UjV ^v G - Therefore MET (G,w,dx) can be computed in polynomial time. 

A second relaxation for the 0-Extension problem, called the earthmover relaxation, was proposed 
in (CKNZ05] and studied extensively in [AFH + 04[ IKKMR09] . The idea is similar to (1261) , except 
that the semi-metric d : VqX Vq —> [0, oo) is further restricted to a special class of semi-metrics, over 
which the corresponding minimization can still be cast as a linear program. Given two measures 
y, v on T with the same total mass, i.e., y(T) = n(T), let H(y,u) be the set of all couplings of 
y and u. Thus, n(/i, u) consists of all the measures tt on T x T such that 7r ( x > v) = ^( x ) 

and Yly^x x ) = f° r every x G T. The assumption that y and v have the same total mass 
ensures that Il(y,v) / 0; specifically (y x u)/y(T) G II {y,v). The Wasserstein 1 distance between 
y and v (also known as the earthmover distance between y and v) is defined to be 

W? T (y,v) d = min V dr(x,y)ir(x,y). (27) 

Let 7x denote the set of all probability measures on T and define 


EMD (G,w,dx) 


def 

= mm 

V xET, fj, x =S x 


^ w(u,v)W^ T {y u ,y v ). 

{u,v}&E g 


(28) 


Here for x G T the point mass at x is denoted <5^. The minimization in the right hand side of (128(1 
also amounts to a linear program, with variables corresponding to couplings tt uv of the probability 
measures y u and y v for each {u, u} G E G , and objective J2{ U}V } £ e g Z )x, y eT w ( u > v )dr(x, y)n uv (x, y); 
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the requirements of being a probability measure or a coupling are clearly linear constraints. The 
quantity EMD(G, w, dx) can therefore be computed in polynomial time. 

If {Hu}uev G — ^T are such that fx x = S x for every x € T then d(u,v ) = W^ T (n Ul /j v ) is a serni- 
rnetric on Vq that extends dx- Consequently, the minimization in (l28l) is over a smaller set than 
the minimization in (1261) . At the same time, if {C X } X £X is a partition of Vq with x € C x for every 
x € T, then by defining /i u = 8 X for every (u, x) & Vq x T we see that the minimization in ([25]) is 
over a smaller set than the minimization in (l28l) . These observations show that every instance of 
0-Extension satisfies 


MET(G, w, d T ) < EMD(G, w, d T ) < OPT(G, w, d T ). (29) 


Building on ideas of [CKR05I . it was shown in [FHRT03] that every instance of 0-Extension 
satisfies OPT(G, w, dr)/MET(G, w, dx) < (log |T|)/log log |T|. This yields the best known ap¬ 
proximation algorithm for the 0-Extension problem, the algorithm being to compute the quantity 
MET (G,w,dx), i.e., to apply the metric relaxation. It follows from (1291) that by using the earth- 
mover relaxation, i.e., by computing EMD(G, w, dx), one could potentially obtain an even better 
algorithm. This was realized in [AFH + ~04] in special cases (e.g. when (T, dx) is a planar graph), 
but in [LN03] it was shown that the same approximation guarantees as in [AFH + 04| can be ob¬ 
tained by using the metric relaxation. Nevertheless, in [KKMR09] it was shown that in a certain 
sense the earthmover relaxation of 0-Extension does perform better than the metric relaxation: the 
earthmover relaxation behaves better than the metric relaxation if one measures the approximation 
factor in term of the ratio between the largest distance and the smallest nonzero distance in (T, dx)- 
On the negative side, it was shown in |CKR,05j that there exist instances of 0-Extension with 
|T| arbitrarily large and OPT(G, w, dr)/MET(G, w, dr) > ydog \T\. More recently, in [ KKMR09 ] 
it was shown that there also exist such instances with OPT(G, w, dT)/EMD(G, w, dr) > i/log |T|. 


As described above, the earthmover relaxation of 0-Extension is in principle better than the 
metric relaxation, but at present the general bounds that are available in the literature are to a 
large extent the same for both relaxations. Nevertheless, our approach to Theorem [I] is based on 
exploiting the differences between the metric relaxation and the earthmover relaxation. The idea is 
that if an instance of 0-Extension behaves significantly worse for the metric relaxation than for the 
earthmover relaxation then the terminal metric (T, dx) can be embedded isometrically into a larger 
semi-metric (Vc,d) for which Yh{ uv \£E G w ( u i v)d(u, v) is much smaller than OPT (G,w,dx), yet 
whenever one assigns to every vertex u € Vq a probability measure /i u € (Pt such that [i x = S x for 
every igl, then JN v}ge g w ( u i v )W[ It must be close to OPT(G, w, dx)- This discrepancy 

between the two relaxations can be used to show that the mapping that assigns to every i 6T the 
point mass 5 X € Tt cannot be extended to a function from {Vq- di) to (Tr, Wf T ) that has a small 
Lipschitz constant. Here the target space (“Pt, W-f T ) is not a normed space, but this idea can be 
modified so as to yield a poorly extendable function with values in a related normed space, namely 
the dual of the space of real-valued Lipschitz functions on (T, dx) that vanish at a fixed point. In the 
end, the entire argument as presented in Section [3] uses geometric and combinatorial considerations 
that are self-contained and do not make any reference to the clustering objective OPT(G, w, dx)- 
The examples of metric spaces that we use to prove Theorem [T| are modifications of the examples 
that were considered in [CKR05] . which are themselves in the spirit of an example that was used 
in [JLS86 ] (and, we use yet another variant of the example of [JLS86 ] to prove Theorem 0]). 


3 . Proof of Theorem □ 

Before passing to the proof of Theorem [lj which appears in Section 1.3 .1 1 below, we need to recall 
some (simple) background and to introduce some basic constructions. 


li 



























3.1. The r- magnificat ion of a metric space. Given a metric space ( X , dx ) and r > 0, for every 
subset SCI we shall define a new metric space X r (S ) that we call the r-magnification of ( X , dx) 
at S. As a set, X r (S) equals X. The new metric dx r (s) 011 X is defined by setting dx r (s){ x i x ) = 0 
for every x G X, and by defining for every distinct x, y G X, 

rip f 

dx r (S)( x ,y ) = dx( x ,y) +r\{x,y} n S\. (30) 

It is immediate to check that dx r (S) is indeed a metric on S. Note that this construction increases 
all the positive pairwise distances within S while keeping the pairwise distances within X \ S 
unchanged. This is the reason for our choice of terminology. If G is a connected graph then do 
will always stand for the shortest-path metric on Vq, and for S C Vq and r > 0, the metric space 
G r (S ) will always be understood to be the r-magnification of ( Vc,dc ) at S. 

The relevance of the r-magnification of (X,dx) at S C X to potentially proving impossibility 
results for Lipschitz extension is simple to explain. By making the positive pairwise distances 
within S larger, we make it easier for functions that are defined on S to be Lipschitz, so there are 
more “potential counterexamples” on {S,dx r (s)) than there were on ( S,dx )■ At the same time, by 
keeping the pairwise distances within X \ S unchanged, we do not make the Lipschitz condition 
on V \ S any less stringent. However, there is a limitation to this reasoning because as r becomes 
larger the minimum positive distance in (S,d Xr (s)) becomes closer to the diameter of (S,d Xr (s))i 
in which case bounds on ae(e) become relevant. Indeed, by (l30l) any distinct x,y G S satisfy 

2 7" —|— \ 

d.Yr(s)(x, y) > 2r + 1 = _ i _^_ y diam(S,d Xr(s) ). (31) 

Since ac(e) < 1/e, it follows that any 1-Lipschitz function from (. S,dx r ts )) to any Banach space can 
be extended to a function defined on (X, d\ r (s)) whose Lipschitz constant is at most a constant 
multiple of (2 r + diarn(5, d_\'))/(2 r + 1) ^ 1 + diam (S,dx)/r. There is therefore a tradeoff that 
limits how large r could be if one wishes to use the r-magnification for the purpose of obtaining 
a lower bound on ac(n). Below we shall balance these constraints, for an appropriate choice of an 
initial metric space ( X, dx ), so as to yield Theorem [0 

3.2. The Wasserstein 1 norm. For a finite set X we denote (as usual) by the | A|-dimensional 
vector space of all / : X —>• R. We also denote by Mj/ the subspace of consisting of those 
/ : X —>• R that satisfy Ylxex f ( x ) = 0- The standard basis of R A will be denoted by {e x } x£ x, 
i.e., e x (y) = l{ x=y } for every x,y G X. For / G R A we shall use the (standard) notation 

ll/lkpo d = l/( x )l- 

xex 

If (X,dx) is a finite metric space then let K( X ,d x ) — following convex hull. 

K (x.dx) = f conv | dx(x 6 y) 1 X ' V G ^ and X ^ y } ’ 

K(x,d x ) clearly an origin-symmetric convex body, so it is a unit ball of a norm on R A , called the 
Wasserstein 1 norm induced by X , which we denote by || • || Wi(x,d x ) ■ the Kantorovich-Rubinstein 
duality theorem (see [Vil03l Thm. 1.14]), denoting as usual f + = max{/, 0} and = max{—/, 0} 
for every / G R‘ A , and recalling (1271) . we have 

V/ G Rg , ll/ll Wi(x,dx) = = inf V dx{ x ,y)TT(x,y). (33) 

7ren (/+./ 


12 






Observe that this makes sense because the assumption / € Mff. i.e., that Ylxex f( x ) = Oj implies 
that the nonnegative functions f + and f~ satisfy, Ylxex f + ( x ) = YIxgx S°> f + and f~ are 

nonnegative measures with the same total mass. 

We record for future use the following very simple lemma. 

Lemma 7. Let (X,dx) be a finite metric space. Then \\e x — Cy\\w^(x,d x ) = dx{x,y) for every 
x, y £ X and 

V/ € Mff, ^ mu^dx{x,y)\\f\\ hi x) \\f\\wi(x,dx) < ^diam(X)||/|| 4(x) . (34) 

x^y 

In particular, for every r > 0 and S C X we have 

V/6K», r||/||« l(s) « ||/|k l(SAr(s ,) « (>- + ^®)ll/lk (S) . (35) 

Proof. The fact that \\e x —e y \\w 1 (X,dx) = dx(x, y) for every distinct x, y E X is immediate from (J33|), 
and also directly from ([32l) . which says that (e x — e y )/dx(x,y) has unit norm in W\ (X, dx). To 
prove ([Ml) , denote by m the minimum nonzero value that dx attains. For distinct x,y € X, 


This means that for every distinct x,y € X we have (e x — e y )/dx (x, y) € (2/m)Bi 1 ^ X ), where ( X ) 
is the unit ball of £\{X). By (IMll we therefore have K( X ,d x ) — which is the same as 

the first inequality in ([Ml) . The second inequality in (1341) follows directly from (1331) . noting that 
the sum in the right hand side of (l33l) is at most diam(X) XLex f + ( x ) = diam(X)||/||^ 1 p s q/2. The 
estimate (I35p is a special case of ([Ml) because by ()30l) the minimum nonzero distance within S of 
the r-magnification of X at S' is at least 2r, and the diameter of X r (S ) is at most 2r + diam(X). □ 

3.3. Properties of expanders. In the proof of Theorem Q] we shall use several properties of 
graphs in general, and expander graphs in particular. Here we collect these facts for ease of later 
reference. Fix two integers n,d ^ 3 and let G be a connected n-vertex d-regular graph. Recall that 
the shortest-path metric that G induces on V G is denoted do- The following estimate is standard. 

V0 # S' c Va, X •£ da(x,y) > (36) 

To quickly verify the validity of ()36l) , observe that since the smallest nonzero distance in G is at least 
1, the average that appears in the left hand side of (1361) is always at least |5|(|5| — 1)/|S'| 2 = 1 —1/|5*|. 
One checks directly that 1 — 1/a ^ (log a)/(4 log 3) for every a € {1,..., 15}, so in order to 
establish (1361) we may assume that |S| ^ 16. Denote k = 1 + |_log rf (|-S'!/4)J and observe that for 
every x £ Vq the number of y € V G with d G {x, y) ^ k— 1 is at most l+d +.. .+d k ~ l ^ 2d fc_1 ^ \S\/2. 
The average that appears in the left hand side of (1361) is therefore at least k/2, and it remains to 
note that k ^ log rf (|S'|/4) ^ (log |£|)/(21og d), since |5| ^ 16. 

Fixing S C V G and r > 0, we shall need later the following straightforward evaluation of the 
average length of edges of G in the r-magnification of (G,d G ) at S. 

fW ~\ d Gr{s) (x,y) = l + ^p-. (37) 

1 G| {x,y}GE G 

Indeed, let E\ be those edges in Eq that are contained in S and let E 2 be those edges in Eq 
that contain exactly one element of S. Because G is d-regular, we have 2|F?i| + \E 2 \ = d\S\. 


max 

c,y£X 


r. 


dx(x,y) 


^ max 
(X) x ’V^ X 


\ e x e y IUi(X) _ 2 
m m 
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Recalling (l30lh for every {x,y} G E\ we have d.G r (s) ( x , y) = 2r + 1, for every {x,y} G -E 2 we have 
dGr(S) ( x ,y) =r + 1, and for every {x, y} G Eg \ (Lj U E 2 ) we have d Gr (s) (x, y) = 1. Consequently, 


-WJ X d GAS)(x,y) 

(x,y)eE G 


(\Eg\ - l^il - \E 2 \) + (2r + l)|£i| + (r + 1)|£ 2 | 

I^Gl 

r(2\E\\ + |-E 2 |) 2r|S| 

dn/2 n 


Given two disjoint subsets 5, T C Vq, denote the number of edges in Eg that intersect both S 
and T by Eg(S,T). The edge-expansion of G, denoted 4>(G), is the largest f> G [0, 00 ) such that 

VSCRc, E G (S,V G ^S) ^ ^ n ~^ \E G \. (38) 

n z 

It is well known that ([38]) is equivalent to the assertion that every h : Vq i\ satisfies. 

^2 X IW®) - h (y)h < tJ-t X) IIM*)( 39 ) 

x,yeV G G {x,y}£E a 

The equivalence of (1381) and (1391) is a standard application of the cut-cone decomposition of subsets 
of £i; see e.g. inequality (4) in |Mat97| or [NRS05L Fact 2.1]. The following simple combination 
of (1391) and Lemma [7] will be used later. 


Lemma 8. Fix n G N and <f> G (0,1]. Suppose that G be an n-vertex graph with 4>(G) (f>. For 

every 0/5 C Fg and r > 0, every F : V G —» Mq satisfies 


1 


n 


2 


X F (y)llwi(S,d GT . (s) ) < 

x,yev a 


2 r + diam(5, d G ) 
(2r + 1)0 


—— X H F ( X ) ^'(?/)lliy 1 (S,d Gr( s ) ) • 

1 1 {x,y}&E G 


Proof. By (]39l) we have 

4 •£ ||F(x)-F(!,)|| tl(s) «-i- £ IIFM - Ffe)ll, l(s , (40) 

x,y&V G G| {a:,y}eE G 

Recalling (]30|). we have diam(5, d Gr (S)) = 2r + diam(5, d G ) and the smallest positive distance in 
(5,d Gr .( 5 )) equals 2r + 1. So, by Lemmad every x,y € R G satisfy 

4±! ||F(x) - Ffo)||, l(s) < ||F(x) - Ffe)ll m(s ,, G> . (s)) < 2r + d ‘7 ( ’ S,<iG) IIF(X) - F(y )|| (l(s) . 
Lemma [8] now follows by substituting these estimates for pairwise distances into (1401) . □ 


We end by recording for future use the following direct consequence of Menger’s theorem |Men27| . 

Lemma 9. Let G be an n-vertex graph and A,B C Vq satisfy A n B = 0. Fix <f> G (0, oo) 
and suppose that <f(G) / fi. Then the number of edge-disjoint paths joining A and B is at least 
0min{|x4|, \B\}\E G \/(2n). In other words, there exists an integer m / </>min{|x4|, \B\)\E G \/{2ri), 
ki,...,k m G N and {u i} i,u i)2 ,... ,u i}ki }Zh Q V G such that C A, {u i) k i } r fLi / B ■ More¬ 

over, {ui yS ,Ui^s- |_i} G Eg for every i G {l,...,m} and s G {1,..., ki — 1} ; and if for some 
i,j G {1,..., m}, s G {1,..., h — 1} and t G {1, 1} we have {u i)S ,u iyS+ 1 } = {ujj,u j}t + 1 } 

then necessarily i = j and s = t. 


Proof. Let m be the maximal number of edge-disjoint paths joining A and B. By the classical 
Menger theorem |Men27| (see also e.g. [DielQL Chapter 3]) there exists a subset of edges E* C Eg 
with \E*\ =?7i such that every path in G that joins a vertex in A with a vertex in B contains an 
edge from E*. Since in the graph G* = ( Vg,Eq \ E*) there is no path that joins an element of A 
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with an element of B, if we let C C V G be the union of all the connected components of G* that 
contain an element of A then C D A and C n B = 0. Since C is a union of connected components 
of G*, all the edges of E G joining C and V G \ C belong to E*. Hence, Eq(C , V G \ C) ^ |.E*| = m. 
It remains to note that by (|38l) we have 

„ ^ c tmr n\ ^ ,,max{|Cj,n - |Cj} • min{|Cj,n - |Cj} 0min{|A|, \B\}\E G \ 

m ^ E g (G, V g \ C) > (p - — 2 - \E g \^ - —-, (41) 

where the last step of flU]) holds true because clearly max{|Cj,n — |Cj} ^ n/2, and since C A A 
and V G \C3B we have min{|Cj,n — |Cj} ^ rriin{|A|, |L>|}. □ 


3.4. A Wasserstein-valued poorly extendable function. Fix d, n G N and < j> G (0,1). Through¬ 
out this section, G will be fixed to be an n-vertex d-regular graph with <f>(G) ^ (f). We shall also 
fix 0 S C V G and r > 0. Define a mapping 

/ = {S,d Gr ( S )) (®o> II • llwi(S,d Gr0s) )) 


by 




V x € S', f(x) d = e x - . 

' ' z&S 


(42) 


Thus / is an isometry (recall Lemma [7J). Suppose that F : V G —>• Mq extends / and for some 
L G (0, oo) we have 


Vx,y G V G , \\F(x) - F(y)\\ Wl{S4GAS)) ^ Ld Gr{s) (x,y). (43) 

Our goal is to bound L from below. 

For every x G Vq and s G (0, oo) define r B s (x) C V G to be the inverse image under F of the 
Wi{S,d Gr (g))-ball of radius s centered at F(x), i.e., 

23 s (x) ^ {y G V G : ||F(x) - F(y)\\ Wl{S , dGr(s)) ^ *} ■ 

By Lemma 0 there exists an integer 

m ^ ^ min {|S \ B s (x)| , |‘B s (x)|} , (44) 

and m edge-disjoint paths joining S \ 23 s (x) and l B s (x). This means that we can find ki,...,k m G N 
and Q V G such that {z jt lj'fLi CS \ r B s (x), { Zj^JLi Q ® s (x), and such 

that {{zj t i, Zj t i- |_i} : j G {1,..., m} A i G {1,..., kj — 1}} are distinct edges in E G . 

Let J C {1,... , m} be such that {zj t i}j^j are distinct and {zj.\}j^j = For every j G J 

denote the number of those i G {1,... ,m} for which zj,\ = z. L ,\ by dj. Since ^, 2 }}^! are 

distinct edges in E G , and G is d-regular, maxj^jdj ^ d. Because ^2 j£ jdj = rn. it follows that 

tii 1441 1 A \ 

\J\ > -j > -min{|5'\® s (x)| ,|S s (x)|} . (45) 

The following lemma provides an upper bound on | J\ that we will later contrast with (|45j) . 


Lemma 10. Under the above notation and assumptions we have 


\J\ G5 max 


d 16(s " r) , 


16 Lnd log d 
log n 



(46) 
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Proof. If | J\ ^ d 16 ' s r > then we are done, so we may assume that |J| > d 16 ^ s r \ or equivalently 

s — r < 


Hence, 


Y ll F (^>i) - 


’PWwdS^GAS)) 


2\J\ 


i,j£j 


^ 2|J| ^ F ( z Jd)Ww 1 (S, 

1 1 ijeJ 

^ -7177 Y d o(Zi,l,Zj,l) - (s — 01*^1 


log|J| 

16 log d 


(47) 

= f is an isometry on (5, d Gr(S )), 


) - d G r (S){ z i,l, z j 

4 ) ® 2 r + dG( 7 i, 7 ,i)- 

(48) 



(49) 

’■GAS)) + ll^^'d) 

~ F ( z i,kj) Hwi(S,d Gr(fl) )) 


3 r (S)l 

~ F ^Y\\w 1 (S,d Gr(s) )) 





2IJI 

1 1 i,jeJ 

\J\log\J\ 

8 log d 


- (s-r)|J| > 


16 log d 1 


(50) 

(51) 


where in (15U1) we used (1151) and the fact that, because {zj.fc, }j^j C “B s (x), it follows from the 
definition of B s (x) that for every i,j £ ,J we have 

\\F{Zi,ki) -F( z j,kj)\\Wi(S,dGr(s)) ^ \\ F ( z i,ki) ~ F ( x )\\wi(S,d Gr ( S) ) + \\ F ( x ) - FiZj^Wwi&dc^s)) ^ 2s ' 

The penultimate inequality in (1511) is an application of (1551) . and the final inequality in (1511) uses (pTTD . 

The quantity in (j49j) can be bounded from above using the Lipschitz condition ([43]) and the 
triangle inequality as follows. 

kj—l 

Y, ||-^( 2 J,l) _ F ( Z j,kj)\\Wi(S,d Gr ( S )) ^ L Y, d G r (S) ( z j, 1; Zj,kj ) ^ L Yj Yj d G r (S)( z j,i, z j,i+ 1)- ( 52 ) 

j&J r jeJ jeJ i =1 

Since {{zjj, 7,j+i} : j € J A z £ {1,..., kj — 1}} are distinct edges in Eg, 

EE^fe».is E ^)(».»)®y(i + ^)- 

jeJ i=l {WjfjSiJG ^ 2 


(53) 


A substitution of (1531) into fl52l) . and contrasting the resulting estimate with ([5T]) . yields 

Lnd f 2r|5|\ ^ |J|log|J| . 


n J 16 log d 


(54) 


It is elementary to check that if a € [l,oo) and b £ (l,oo) satisfy a log a ^ b then a ^ 26/ log b. By 
applying this with a = | J| and b = 8Lnd\ogd(l + 2r\S\/n) ^ n, we see that ([541) implies that 


\J\ < 


16nd log d 
log n 


i + ^). 

n / 


thus completing the proof of (l46l) . 


□ 
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Corollary 11. Suppose that the following conditions hold true. 


d ns-r) / 


0 |S| 


Then 


and L < 


max |'B s (x)| < 
xGVg 2 


(/>|<S| logn 


128 (1 + ^77^ ) ndlogd 


(55) 


(56) 


Proof. Fix x € V G ■ If T> s (x) n S / 0 then it follows from (l36l) that there exist y, z € r B s (x) n S with 

log l'Bs(x) n s\ 


d G (y,z) > 


41ogd 


(57) 


At the same time, since y,z € S and F\s = f is an isometry on (S, d Gr (s)), we know that 

<h{y,z) + 2 r ® d Gr ( S )(y,z) = \\F(y) - F(z)\\ Wl (s,d GAS) ) 

< ll^(y) - F{x)\Iw^dGAs)) + - F ^\w 1 (S,d GAS) ) < (58) 

where in the last step of (1581) we used the fact that y,z € "B s (x). Contrasting (IFF)) and ([551) yields 

|® s (s) n S\ < d 8(s - r) ^ ^ (59) 

where we used the first assumption in ([55]) (and that <fi ^ 1 ^ |5|). It follows from (159[) that 
\S \ ‘B s (x)\ ^ 3|5|/5. By combining (H5j) and (TF6l) with this lower bound on \S \ T> s (x)\ we see that 


min | -U-, |‘B s (x)| )• < max 


4 d i 6 (s- r ) Q/LLndlogd f i + 2r\S\ 




(flog n 


n 


(60) 


But, the two assumptions in (1551) imply that 3|S|/5 is greater than the right hand side of (j60|) . so 


64Lndlog d (, , 2r\S\\ \ m |5| 
IWK-, —, 01ogn (i + — j < T - 


Corollary 12. If the conditions in m are satisfied then 

(fs 




2 1 + 


diam(G,dG) 
2 r 


i + 


2r|S| 


□ 


(61) 


Proof. For every x G V G and y G V G \ ® s (x) we have ||+(a;) - F(y)\\w 1 (S,d Gr (.s)) > s 

4 W F ( x ) - F (y)Ww 1 (s,d GAS) ) > fr 2 \\ F ( x ) - F (y)\\w 1 (s,d GAS) ) 


. Hence, 


x,yeV G 


xeV G yeVa^Hslx) 


> 4 £ <" - I3.(x)l) H 1 - maXtetfc |3jMI ) f |■ ( 62 ) 


x&V G 


At the same time, by Lemma [8] we have 

4 2 ||f(x) - < 2 ’' r9 t^ a ,uf F F ) E 


x,yev a 


m L (l + diam f G) 


(2r + 1 )^|^g| 

XI d G r (S)(x,y) 

{x,y}GE G 


{x,y}&E G 


133 


L 1 + 


diam(G,dG!) 


2r 


i + 


2r|S| 




. (63) 
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The desired estimate (f6TT) follows by contrasting (1621) with (1631) . □ 

Theorem 13. Continuing with the same notation as above, if 0 < r ^ diam(G, do) then 

■ f 1^1 log^ 16r 2 logd + rlog(0|5|/8) ) 

~ 1 _|_ mm \ nd log d' diam(G, do) log d J 

Proof. If 16r logd+log(</>|5|/8) ^ 0 then ([Ml) is vacuous. Supposing that 16r log d+log(0|5|/8) > 0, 
choose s = r + (log(^|5|/8))/(161og d), so s > 0 and rf 16 ( s_r ) = ^151/8. The first inequality of (1551) 
is therefore satisfied, so either the second inequality in (1551) fails, corresponding to a lower bound 
on L , or by Corollary fl2l the lower bound on L in (1611) is satisfied. This simplifies to give (1641) . □ 


For a wide range of the parameters (i.e., n, d, 4>, diam(G, do), |<S|, r), Theorem fTHl yields a nontriv¬ 
ial lower bound on the Lipschitz constant of any mapping F that extends the 1-Lipschitz function 
/ given in (1421) . Rather than treating the general case, we shall now proceed to compute an optimal 
setting of the parameters in (l64l) when G has the property that diam (G,dc) x (log n)/log d and 
4>(G) x 1. Such graphs exist for any degree d ^ 3 and arbitrarily large n G N, and in fact this holds 
true with probability tending to 1 as n —>• oo when G is chosen uniformly at random from the finite 
set of all n-vertex d-regular graphs; for the diameter of random regular graphs see e.g. |BFdlV82] 
and for the expansion of random regular graphs see e.g. IHLW06) Theorem 4.16]. 

So, if ft x 1 and diam(G, do) x (log n )/ log d, and continuing with the assumption of Theorem fl3l 
that 0 < r ^ diam(G, da), the lower bound (1641) becomes 

1 . f |S| log n r (r logd + log |5|) 

~ l a. iM mm 1 nd log d ’ log n 

The optimal choice (up to constant factors) is to take S C Vg with |5| = [ny/d log d /\/log nj (this 
is allowed provided n is large enough so as to ensure that |5| ^ n, specifically it suffices to assume 
that n ^ d d ) and r x y/\ogn/y/dlogd. With these choices we see that L > yj\ogn/y/d log d. Thus, 

(OAS).W t (S,d Gr{s ,)) > -^g|. (65) 

For fixed d (say, d = 4), the estimate (1651) implies Theorem |TJ □ 

Remark 14. The above reasoning also provides a new example showing that ac(e) > 1/e. Recall¬ 
ing ([3ID, every distinct x,y G S satisfy dQ r (s){x,y) ^ ediam(S',d(j r (s)) where e x r/diam(G, do). 
When diam(G, da) x (logn)/log d and r x y'log n/y/dlogd, this becomes e x y/\ogd/y/dlogn. 
The lower bound L > ydog n / y/d log d therefore becomes L > l/{ed). 


4 . Proof of Theorem 0 ] 

In Section o we defined the parameters e a (X, Z) and e“(X, Z), for every a € (0,1], n € N and 
every two metric spaces ( X , dx) and (Z, dz )• Below it will be convenient to also use the analogous 
notation e a (X, S, Z) for S Cl, i.e., 

e a (X,S,Z) d ^ f e{(X,d%),S,(Z,d z )). 

Theorem 0] asserts the validity of the following four lower bounds. 


and 


e a {(. OQ ,i 1 f)>n 4 Q and e“(£oo, t 2 ) > (log n) ^ , 

/ log n 


„21 


e a (ioo^2) > 2 and e“(^oo,4) 


\ log log n 


“ 2 -3 


( 66 ) 

(67) 


18 
































The proof of (l66l) appears in Section I4~T1 below, and the proof of (1671) appears in Section I4~2l below. 


4.1. Twisted unions of hypercubes. For n £ N we identify the discrete hypercube {0, l} n with 
Fg, the vector space of dimension n over the field of size two F 2 . We let ei,... ,e n denote the 
standard basis of Flf and write e = ei + ... + e n . 


Lemma 15. Suppose that a € ( 5 ,1] and r,s € (0, 00 ) satisfy 

(2af a s{2r) 2a - 1 ^ (( 2 a) ^ - 1 1 . 

Define for every (x,i), (y,j) € F?; x F 2 , 

min < s||x — y||i, 2 r + ||x — y\\l a > if * = j = 1 , 


d((x,i), ( y,j )) '= < 


1 

I 2 a 


\x-y\\r if * = J = 0, 

r + min < s||x — y ||i, ||x — y\\l a > if i j. 


( 68 ) 


(69) 


Then d is a metric on W% x F 2 . 


Proof. Recall that if c 0 : [0, 00 ) —>• [0, 00 ) is concave, nondecreasing, and satisfies w(0) = 0, then 
u o dx is a metric on X for every metric space (X,dx). An application of this observation to the 
mapping t 1 —> t l ^ 2a \ which is concave because a > shows that d is a metric on FI? x {0}. Also, 
since both of the mappings t i—>■ st and t 1 —> 2r + t 1 /! 2 ") are concave and increasing, the mapping 
1 1 —)- minjsf, 2r + t 1 /! 2 ")} is concave and increasing. Consequently, d is a metric on Flf x {1}. 

It therefore remains to show that if i,j, k £ {0,1} satisfy {i,j, k} = {0,1} and x,y, z € F£ then 
d((x, i), ( y,j )) ^ d((x, i), ( z , k))+d((z, k), (y,j )). By translation and permutation of the coordinates 
we may assume that x = 0 and y = e 1 + ... + e a for some a £ {1,..., n}. In this case, note that if z 
has a nonzero entry outside {1,..., a} then both ||z||i and ||y — z ||i will decrease if we change that 
entry of 2 from 1 to 0. By the definition of d in (l69|) . this shows that d((x, i), ( z , k)) + d((z, k), (y, j )) 
will decrease if we set all the entries of z that are outside {1,... ,a} to be equal 0. It therefore 
suffices to prove that for every a £ {1,..., n} and b £ {1,... , a — 1} we have 

d((0,i), (ei + .. - + e a ,j)) ^ d((0,i), (ei + .. , + e bl k)) + d((e 1 + .. , + e b ,k), (ei + .. - + e a ,j)). (70) 
We shall establish the validity of (170)1 through the following case analysis. 


Case 1: i = j = 0. In this case necessarily k = 


a 2 a ^ 2 r + min 


in 62 a | 


1, so by (1691) the desired inequality (fTOl) becomes 
+ min |s(a — 6 ), (a — 6 ) 2 ^|. (71) 


The mapping lo : [0, 00 ) — > [0, 00 ) given by co(t) = minjst, t 1 / ( 2o: ) } is concave, increasing and 
satisfies w(0) = 0. Consequently, u(b)+u(a—b) uj(a). So, in order to prove (17TT) it suffices to show 
that a 1 ^ 2 ") ^ 2r + w(o) = 2r + min{sa, a 1 ^ 2 ^}. We therefore need to show that a l ^ 2 °^ ^ 2r + sa. 
Define cf : [0, 00 ) —>• [0, 00 ) by f>(t) = 2r + st — t 1 ^ 2 "). The minimum of (j) on [0, 00 ) is attained at 
t m in = 1 / (2as) 2oi ^ 2a ~ 1 h Hence, by our assumption (1681) we have 


(f{a) 0(tmin) 


2 r + 


s 


2a 

(2US) 2a — 1 


(2 as) 


= 2 r — 


(2«)2a-l _ 1 

((2a) 2 “s) 2a — ! 


m 
^ 0. 


Case 2: i = j = 1. In this case necessarily k 


mm 


in jsa, 2r + a 2 a | 


2r + a 2 a 1 ^ 2 r + min 


= 0, so by (1691) the desired inequality (1701) becomes 
|s 6 , b^ | + min |s(a — 6 ), (a — 6 ) 2 ^ j . (72) 
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As in Case 1, by the subadditivity of u we see that in order to prove (1721) it suffices to show that 
min{sa, 2r + a 1 ^ 2 ")} ^ 2r + oj(a) = 2r + minjsa, a 1 ^ 20 ^}, which is immediate. 

Case 3: {i,j} = {0,1}. In this case, by interchanging the roles of b and a — b, if necessary, in order 
to prove the desired inequality (l70l) one must show that the following two inequalities hold true 


+ min |sa, a2c j ^ b 2 <* + r + min js(a — b), (a — b ) 2a | , 


and 


r + min |sa, a 2a | ^ r + min jsfr, & 2 <* j + min |s(a — 6), 2r + (a — b) 2a | 
Thus, we need to check that 

min • 

and 


lin jsa, a 2 =* j ^ b 2 <* + min js(a — b), (a — b) 2 °< | 


min |sa, a 2 “ | ^ min | sb, b^a | + min |s(a — b),2r + (a — b) 2 <* | . 
Inequality (|74l) follows immediately from the subadditivity of uj as follows. 


(73) 

(74) 


mm 


in |s6, & 2 a | + min js(a — b ), 2 r + (a — b) 2 « j ^ ui(b) + w(a — b) ^ c v(a) = min |sa, a 2 ^ j . 


Since the mapping 1 1 ->- t 1 ^ 2a ' ) is subadditive, we have 6 1 /l 2a ) + (a — b) l ^ 2 °^ ^ a l ^ 2a \ Therefore, in 
order to prove (|73f) it suffices to show that b 1 ^ 2 ^ + s(a — b) ^ minjsa, a 1 ^ 2 ")}. Define if : [0, a] —>• K 
by lift) = tV( 2 “)+s (a—t). Then if is concave. It follows that the minimum of if on the interval [0, a] 
is attained at one of its endpoints, and therefore if(b) ^ min{^/>(0), ip (a)} = minjso, a 1 ^ 2 "^}. □ 


The proof of the following theorem is a variant of an argument of [JLS86] . 


Theorem 16. Continuing with the notation of Lemma [/5| and assuming from now on that the 
condition (l68l) is satisfied , define f : FJf x {0} —>• by setting f(x, 0) = x for every x € F£. Then 
|| f(x, 0) — f(y, 0)||2 = d((x, 0), (y, 0)) a for every x, y € Fg. At the same time, if F : Flf x F 2 —>• Pf 
extends f and satisfies ||.F(u) — F(u)|| 2 ^ Ld(u,v) a for some L € (0,oo) and every u, v € Flf x F 2 
then we necessarily have 


L> 


n 


2 r c 


(75) 


Proof. Note that for every x,y £ Fj we have 

||/(®,0) - f(y, 0)|| 2 = ||x-y|| 2 = \/\\x — y\\i = d((x, 0), (y, 0))“, 

by the definition of d in (1691) . Next, the assumptions on F imply that for every x, y 6 FJ and every 
j € {1,..., n} we have 

|| F(x + ej , 1) - F(x, 1)|| 2 ^ Ld((x + ej, 1), (x, 1))" ® Lmin{s a , (2r + 1)“} ^ Ls a . 

Hence, 

n 

E E W p ( x + e F !) - p ( x ’ !)II 2 < n2 n L 2 s 2a . (76) 

j= 1 X &2 

Also, the following estimate holds true for every x € F^. 

\\F(x + e, 1) — F{x, 1)|| 2 

^ ||F(x + e, 0) - F(x, 0)|| 2 - ||F(x + e, 1) - F(x + e,0)|| 2 - \\F(x, 1) - F(x,0)|| 2 
^ II f{x + e, 0) - f(x, 0)|| 2 - Ld((x + e, 1), (x + e, 0)) a - Ld((x, 1), (x, 0))“ 

= y/n- 2Lr a . 
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Hence, 


(77) 


|| F(x + e, 1) — F(x, 1)||| ^ 2 n (max {y/n — 2Lr a , 0}) 2 . 

xe¥% 

By a classical inequality of Enflo [Enf69| (see also [Mat02l § 15.4]), 

n 

E \\F(x + e,l)-F(x,l)\\ 2 2 ^Yl E ll^ + e i ,l)-E(x,l)|||. ( 78 ) 

iSFj j=1 sSFj 

A substitution of (1761) and (1771) into (1751) now yields the estimate 

yfn — 2Lr a ^ Ls a yfn =>■ L ^ -. □ 

s a - v /n + 2r ' a 

One should clearly choose those r, s € (0, oo) that satisfy the constraint ()68|) and maximize the 
right hand side of (1681) . This yields better dependence (in terms of an a-dependent constant factor 
but not in term of dependence on n ) than the following sub-optimal choices. 

def —ky , def — 2a ~f 

r = n^ 2 and s = n 4c 2 . (79) 

It is elementary to check that under these choices the constraint (|68l) is satisfied, yielding the 
estimate L > n( 2a ~ 1 ' 1 /l 4 ") in Theorem H6l This implies the validity of (1661) . i.e., the first half of 
Theorem U Namely, the following bounds hold true. 

Corollary 17. Continuing with the notation of Lemma 11.51 we have 

e“(4o,^) > e“((F2 x F-,d),F" x {0},^) > n^r. 

Also, since n x log |F?>|, it follows that for arbitrarily large N £ N we have 

2a — 1 

e%(locj 2 ) > (log N)~^~. 

Remark 18. Consider the mappings uq,uji : [0, oo) —>• [0, oo) given by 

Vt^O, wo(t) = and wi(t) = min{st, 2r + wo(t)}- 

Both wo and uq are concave, increasing and vanish at the origin. The metric d given in (f69l) satisfies 
d((x,0),d(y,0)) = wo(||x — y||i) and d((x, 1), (y, 1)) = wi(||a; — y||i). As explained in Remark 5.5 
of jMN15| . it follows that both of the metric spaces (Fg x {0},d) and (F£ x {l},d) embed into t\ 
with 0(1) distortion. We do not know whether the metric space (Fg x F 2 ,d) admits an embedding 
into l\ with 0(1) distortion. If it were the case that any embedding of (Fg XF 2 , d) (say, when a = 1) 
into i\ must incur bi-Lipschitz distortion that tends to 00 as n —> 00 , this would be the first known 
example of a metric space that can be partitioned into two subsets, each of which well-embeds into 
yet the entire space does not. For more on such questions, see |MN13b| . 

Remark 19. The metric d in (f69l) . as well as the magnification of a metric space that was described 
in Section 13.11 are both special cases of a generalization of a construction that was used in |JLS86j 
for the purpose of proving a Lipschitz-nonextendability result. Variants of this construction were 
also used in jLan991 ICKR05I . The general idea is the following procedure to “glue” two metric 
spaces. Suppose that (X,dx) and ( Y,dy) are finite metric spaces with X and Y disjoint as sets. 
Suppose also that we are given a mapping a : X —> Y and r € (0,oo). Define a weighted graph 
structure on A U Y as follows. If x\,xi € X then join x\ and X 2 by an edge of weight dx(x 1 , 2 : 2 ). 
Similarly, if yi,y 2 G Y then join y\ and y 2 by an edge of weight dy(yi,z/ 2 )- Also, for every x € X 
join x and o(x) by an edge of weight r (a further generalization of this procedure could allow 
the weight of the edge {x, u(x)} to depend on x). Consider now the shortest-path metric that this 
weighted graph induces on XUY. One can check that the above metric spaces can also be described 
as subsets of this general construction, whose usefulness is probably yet to be fully exploited. 
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4.2. Linearization. Our goal here is to prove (f67l) . thus completing the proof of Theorem 01 The 
argument below is based on the linearization procedure of [JL84j . with several modifications that 
yield quantitative improvements and, more importantly, allow us to to treat Holder mappings. 

In what follows, given a Banach space (IT, || • ||w) we denote 

B\y = {x £ W : ||x||vv ^ 1} and Sw = dB\y = {x £ W : ||x||w = 1}- 

If V C IT is a linear subspace then we will always consider it as being equipped with the norm 
inherited from IT, thus slightly abusing notation by denoting the sets V n B\y and V n Sw by By 
and Sy, respectively. A mapping h : W\ —>• IT 2 between two real vector spaces ITi,IT 2 is said to 
be positively homogeneous if h(Xx) = A h(x) for every A £ [0,oo). 

Lemma [20] below is a variant of Lemma 5 in |JL84j , with the difference being that in |JL84j the 
conclusion corresponding to (18111 is weaker in the sense that the factor y/rnD is replaced by m. The 
difference between our proof and the proof in |JL84j is that we perform averaging in the image of 
T rather than in X itself, allowing for a more refined estimate that we shall use later. 

Lemma 20. Fixm £ N and D £ [l,oo). Let (Y, || • ||y) and (Z, || • || z) be Banach spaces and ACT 
an m-dimensional linear subspace ofY. Suppose that T : X —> Lffi is a linear operator that satisfies 

Vx€A, ||x||y ^ ||Ta;||2 ^ -D||a;||y. (80) 

Suppose also that we are given a positively homogeneous Lipschitz mapping if : X —» Z and a linear 
operator U : X —>• Z. Then there exists a positively homogeneous mapping T : T —»■ Z that satisfies 

H^llLip < HV'llLip and \\^\x-U\\u P <VrnD sup \\'fi(z)-Uz\\ z . (81) 

zeS x 


Proof. Write B™ = Bpn and 


v m = vol (B™) = 


7T 2 


r (i + f) 


Define d> : Y —>■ Z as follows. 


Vy £ Y, 1 j>(y) c = — f if (y + T 1 rc) dw = — f if (T 1 u») dw. 

v m Jb™ v m JTy+B™ 

Then, because cf is obtained by applying an averaging operator to if, we have 

H\\up < 11^11 Lip- 

Also, because ^>(0) = 0 (recall that if is positively homogeneous), we have 


sup 

yeS Y 


itsol) 

z < IIV'llLip sup ||y + T _1 u;||y < 2||^||Li P - 

(y,w)eS Y xB™ 


Similarly, 

sup \\(f(y) - Uy\\z = sup — [ (if — U)(y + T~ l w) dw 
y&Sx y&Sx v m JB™ 


2 

-T—1. 


(82) 


(83) 


(84) 


y&S x y&S x 

^ sup \\(if - U)(y + T~ l w)\\ z ^ sup \\if(z) - Uz\\ z = 2 sup \\if(z) - Uz\\ z , (85) 

(y,w)£S x xB? zG2B x zGS x 


where the penultimate step in (185D holds true because, due to (1501) . we have T 1 i? 2 1 C Bx, so 
Sx + T~ 1 B™ C 2 Bx- The final step of ([SOD holds true because if — U is positively homogeneous. 
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Fix distinct x, y G Sx and write 2r — ||x — y\\ y. Then, since U and T are linear, 


(' i/j — U)(T l (w))dw -— [ 

) v n J,r 


(Ty+B™)\(Tx+B%) 


^-U){x)-{ct>-U)(y) 

v m J(Tx+B™)^(Ty+B™) 

Consequently, 

- u )( x ) ~(4>~ U){y)\\ z ^ — f \\(tp- U){T~ x (w))\\ z dw 

v m J(Tx+B^)A(Ty+B^) 

vol {{Tx + B™)A(Ty + B™)) 


(i/j — U)(T l (w))dw. 


< 


sup 


z)-Uz\\ z . (86) 


We note that 

vol ((Tx + B ”*) U (Ty + Bf)) ^ v m + ||Tx - Ty\\ 2 v m . i. (87) 

Indeed, by rotation invariance, it suffices to verify (1871) when Tx = re i and Ty = —re i, where 
ei,... ,e m is the standard basis of M m and we recall that || Tx — Ty\\ 2 = 2r. Identifying M m with 
R x R m_1 , the estimate (1571) is a consequence of the fact that (rei + B™) U (—rei + B™) is contained 
in the union of the three sets ([r, oo) x R m_1 ) n (re i + B ™), ((—oo, — r] x R m_1 ) D (—re i + B™) and 
[—r, r] x B™^ 1 , whose interiors are disjoint. 

Now, due to (|57j) we have 

vol ((Tx + B™)A(Ty + B?)) = 2vol ((Tx + B™) U (Ty + B?)) - vol (Tx + B%) - vol (Ty + B%) 

< 2||Tx - Ty\\ 2 v m -i- 


So, the first term in the right hand side of 
vol ((Tx + B%)A(Ty + B™)) 


can be bounded as follows. 

2v i ® 

^—-—\\Tx — Ty\\ 2 ^ y/mD\\x - y\\ Y 


Vn Vjji 

where we used the fact that 2v m -i/v m ^ \/m, which follows from 
addition, we actually have linim^oo 2u m _i/u m = \J2/-k). 

To bound the second term in the right hand side of 

?2 


( 88 ) 

and Stirling’s formula (in 


Therefore (x + T l B™)A(y + T l B™) C 2 Bx, since x,y € Sx- So 


sup 

zG{x+T~ 1 B^ l )A{y+T- 1 B^ 1 ) 


z)-Uz\\z<: sup 

z£2Bx 


, note that T l B™ C Bx, by 

z) ~ Uz \\ z , (89) 


z) — Uz\\z = 2 sup 
zGSx 


where we used the fact that i/j — U is positively homogeneous. 
A substitution of (1551) and (1891) into (1861) shows that 


- U)\s x \\up < 4 sfmD sup 
26 S x 


z) - Uz\\ z ■ 


(90) 


Define T : Y -A Z to be the positively homogeneous extension of (j)\s Y , i-e., \H(y) = \\y\\Y(t>(y/\\y\W) 
for every y £ Y \ {0} and 'L(O) = 0. Then a simple computation (that is carried out in detail 
in [.IL841 Lemma 2]) shows that 

(pli Aifsa 

ll^llLip ^ 2||^|5y|| Li p + sup W<t>(y)\\z 4||^||Lip- 

y£S Y 

For the same reason, since T — U is the positively homogeneous extension of (cj> — U)\s x to X, 

(fS5t A(IWH) 

\\^\x-U\\ U p 2\\((j)-U)\s x \\up+ sup \\4>(y)-Uy\\ z ^ 10^/mD sup \\i/)(z) - Uz\\ z . □ 

yGSx 26 Sx 
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We shall use below the following lemma of Begun |Beg99| ; alternatively one could use below an 
argument of Bourgain [Bou87j that Begun significantly simplified in |Beg99| . 

Lemma 21. Fix S,L,t € (0, oo) and m € N. Suppose that (Y, || ■ ||y) is an m-dimensional normed 
space and that (Z, || • \\z) is a Banach space. Let K C Y be a convex set and suppose that we are 
given a mapping F : K + rBy —>• Z that satisfies 

Mx,y € K + tB y, \\F(x) - F(y)\\ z ^ L(\\x - y\\y + 5). 


Define h : K —>• Z hy 


VxeK, 


r / \ def 

h[x) = 


/ F(x + y)dy, 
J tBv 


vol (tB y ) J tBy 

where the integration and vol(-) are interpreted through an identification ofY with M m . Then 


(91) 


IWlLip « L (i + 


Corollary 22. Fix t € (0, oo), a € (0, 1] and m € N. Suppose that (Y , || • ||y) is an m-dimensional 
normed space and that (Z, || • \\z) is a Banach space. Let K CY be a convex set and suppose that 
F : K + rBy —>• Z is a-Holder. Then there exists h : K —)• Z that satisfies 


sup | 
xgk 


(*) - F (x)\\z < r Q ||F|| Lip(Q) and 


.. a 2 (m/r) 1 a + 1 

11 ^ 11 Lip ^ --- ll^llLip(a)- 


a 


Proof. Denote L = ||Y|| Lip ( a \. Note that for every 5 € (0,oo) and every x,y € K + tB y we have 


||F(x) - F(y)\\z < L\\x - y\\y < L(\\x - y\\ z + 6) sup —— 

t>0 t + 0 


(1 — a) 1 a a a L 
5 ^ 


\x-y\\z + d). 


We may therefore apply Lemma eh deducing that for h defined as in (IM1) we have 

Sm\ 


n > ii (1 — a) 1 a a a L 

IHlLip < -—^- 


1 + 


2 r J 


(92) 


The value of 5 that minimizes the right hand side of (19211 is <5 = 2r(l — a)/(am), which gives 


Lip ^ (1 Ol) 


l ~ a a a 


a 


1—a 


( m \ 1 — a 


(1 — a) 1 a \2 t) 
It remains to note that for every x € K we have 

1 i 


+ 


a 


a 2 (m/r) 1 “ + 1 

lJ ^ - 1 j . 


a 


\\h(x)-F(x)\\ z = 


vol (tB y ) J tBy 


(F(x+y)-F(x))dy 


< 


vol (tBy) J tBy 


L\\y\\$dy ^ Lr a . □ 


Theorem 23. There is a universal constant k € (0,1) with the following properties. Fix a € (0,1], 
M,m € N and C,D e [l,oo). Suppose that (Y, || • ||y) is an M-dimensional normed space and 
that (Z, || • || z) is an m-dimensional normed space. Let X CF be an m-dimensional subspace ofY 
whose Banach-Mazur distance to Pfi is at most D, i.e., there exists a linear operator T : X —>• 1™ 
satisfying (1501) . Suppose also that U : X —>• Z is a linear operator with 

\/x € X, IM|y < \\Ux\\ z ^ CHxlly. (93) 

For every e € (0,1) let X £ be an e-net in Sx- Then, under the additional assumption that 

K 

y/mCDe°(Y. N e . Z) 
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there exists a linear operator V : Y 

rl—Oi 


Z that extends U and whose operator norm satisfies 


Y^z < aM 1 ~ a m 1 ^C^D 1 -^e a (Y,^ e ,Z)^ + -e a (Y,N £ ,Z). 

a 


( 95 ) 


Proof. If x, y € Bx then ||x — y\\y ^ 2, hence \\Ux — Uy\\z ^ C\\x — y\\y ^ 2 l ~ a C\\x — y\\y- This 
implies that the restriction of U to N e is a-Holder with constant 2 C. Denoting L = e“(Y,lSb, Z), 
it follows that there exists g :Y —>• Z that extends U\x s and is a-Holder with constant 2 CL. 

Let p : Z —>• CBz be the radial retraction of Z onto CBz, i.e., p(z) = z if \\z\\z ^ C and 
p(z) = Cz/\\z\\z if \\z\\z > C. It is straightforward to check that p is 2-Lipschitz. Therefore, if we 
define F = po g then ||T|| LiD( - Q ,' ) ^ 2||o|| LiD( - Q ,' ) ^ 4 CL and ||F(y)||z ^ C for every wGh. Moreover, 
for every a e we have K)||z = wX < C, so F(a) = p(g(a)) = g(a). Thus F : Y ^ CB Z 
is also an extension of U\a- Fix T £ (0,1) that will be determined later. By Corollary [22] (applied 
with K = Y) there exists h :Y Z that satisfies 


and 


Then 


sup 

yeS Y 


VyeY \\h(y) — F(y)\\ z ^ 4CLt c 

o?(M /T 1 

i|fe||Lip^4 al ,T) - —CL. 

a 


z < sup \\F(y)\\ z + 4CLT a ^C + 4 CLt° 
y&S Y 


(96) 

(97) 

(98) 


where in the last step of ([981) we used the fact that F takes values in CB Z - 

Denote by ip : Y —> Z the positively homogeneous extension of h\s Y to Y. By [JL84L Lemma 2], 


I Lip ^ 2||/i|sy || L i P + sup 
yeS Y 


z < 4 y ’ - ——CL + C + 4:CLr a . (99) 


a 


Take x 6 Sx (thus fi(x) = h(x)) and a € with ||a — x||y ^ e. Then h(a) = F(a) = Ua. Hence, 


\\ifx) - Ux\\ z < || h(x) - F(x)\\ z + || F(x) - F(a)\\ z + \\Ua - Ux\\ z 

^ ACLr a + 4CL\\x - a||£ + C\\x - a||y ^ 4CLr“ + 4 CLe a + Ce^ 5 CL(t° + e a ). (100) 


Due to (]99l) and (11001) . Lemma 1201 implies that there exists T : Y —>• Z that satisfies 


and 


I'Ll 


Lip 


< 


a 2 (M/r) 1 a + 1 


CL + C + CLr a < 


a 2 (M/r) 1 a + 1 


a 


a 


CL, 


*\x ~ U || Lip < y/rnCDL{r a + e a ). 


By Proposition 1 in |,TL84j . which relies on an important linearization result due to Linden- 
strauss [Lin64j . there exists a linear operator S : Y —>• Z such that 

a 2 (M /t)^ —Q T 1 

WSWy^z < l^llLip < -^-U. - —CL, (101) 

a 

and 

||S|x - U\\x^z ^ ||^|x - U\\ Up < V^CDL(r a + e a ). (102) 

Our assumption (1M1) says that y/mCDLs a ^ k. So, if we choose r = (/c/( y/mCDL )) 1 ^ Q and k > 0 
is a small enough universal constant then it would follow from (11011) and (11021) that 


||S||y_>z < aM l ~ a m^C«D—L« + 


CL 


a 


and \\S\x ~ U || 


a:-» z 




(103) 
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Letting Iz '■ Z —>• Z denote the identity mapping on Z, we have 

(l93llA (ll03t 1 

ll^Wtr 1 - Iz\\ z ^ z = ||(5|x - m-lz^z < II S\x - U\\ x ^z\\U- l \\z^x < 2- 

Consequently, the operator (S|x)C _1 : Y —>• Y is invertible and \\((S\x)U~ 1 )~ 1 \\z^z ^ 2. Define 


Y= f ((5: Y -> Y. 


Then V extends S and 

|y-s-z ^ \\((S\x)U~ 1 )~ 1 \\ z ^ z ||5||y_>.z < 2||5||y_).z < aM 1_a ra^"C»h“L« + 


frost 


CL 

ct 


□ 


Given a normed space (Y, || • ||y) and a linear subspace X C Y, the projection constant of X 
relative to Y is denoted A (X,Y). Thus A (X,Y) is the infmium over those A € [l,oo] for which 
there exists a linear projection P from Y onto X with ||-P||y_»jr Y A. 


Corollary 24. Let k be the universal constant from Theorem \2tA Fix a € (0,1], M, m € N and 
D e [1, oo). Let X C Y be an m-dimensional subspace of Y whose Banach-Mazur distance to IVf 
is at most D. For every e € (0,1) let 3ST e be an e-net in Sx ■ Then, 


e € 




e Q (Y,X £ ,q 


> 


A (X,Y) C 


Af Q ( 1_Q ) D 2 ~ a 


y/fFD\(X, Y) 

In particular, there exists n € N satisfying log n x m (A + log m) for which 

\{X, Y) a 


min {e a (Y, Iff), e“(Y, Off)} 


> _ 

rsJ 1 — Ot /-t \ c\ 

m~^~ M"! 1 "®) D 2 ~ a 


( 104 ) 


( 105 ) 


Proof. Since the Banach-Mazur distance of X to Fff is at most D, the lower bound on e a (Y, . Pff ) 

that appears in (|104H will follow if we show that 


e a (Y,X e ,X) > 

We may assume from now on that 


A(Y,Y) c 


r*j 1 — a / -t \ i 

m~ M a ^-- a ) D l ~ a 


( 106 ) 


e^Y,^,*)^ — 


A(X,Y) c 


- n , ( 107 ) 

m—M a ^- a )D^~ a 

since otherwise there is nothing to prove. By the assumption on e in (11041) combined with (11071) , 


e C 


A7M 1 -" 


< 


v mDX(X,Y) \y/IEDe a (Y,X e ,Z) 


We can therefore apply Theorem [23] with Z = X and U = I\ being the identity on X (in particular, 
C = 1). The operator V : Y —>• X thus obtained is a projection onto X, since V\x = Lx- Hence, 


A (X,Y) ^ \\V\\ Y ^x 


< 


aM 1 - a m^D l ^ L e a (Y,X £ ,X)^ + - 


l (Y,X £ ,X) 


a 


< 

r^j 


M 1 a m 2c D a e a 


(Y,X e ,X)~ 


a 


which simplifies to give the desired estimate (I106D . 

To deduce (11051) . note that by the Kadec'-Snobar theorem jKS7lj we have A (AT, Y) ^ y/m and 
by John’s theorem [ Joh48j we have D ^ y/rn. Therefore, if we choose e = K 1//Q /m 3 / 2 then the 
upper bound on e that appears in (|104l) holds true. It remains to note that if we set n = |3M e | then 
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VLUi\{e a (X, ^ 2 l ),e“(X, £™)} ^ e a (Y,'N e ,X), and by standard estimates (e.g. [MS86j i on the size of 
e-nets in m-dimensional normed spaces we have logn x mlog(l/e) x m(logm + 1/a). □ 


The following corollary implies (1671) , thus completing the proof of Theorem [4) 
Corollary 25. For arbitrarily large m,n € N and every a € (1/2,1] we have 

/ logn 


e a (h ,F?)>m° 


and e“(^i,^ 2 ) 


\ log log n 


a2 ~h 


Proof. By [FLM771 IKas77j . for every m € N there exists an integer M = 0{m ) and an Tri¬ 
dimensional subspace X of if 1 whose Banach-Mazur distance to £f‘ is 0(1). By [Rut65| , we 
have \(X,£?) > y/rrt. Therefore, an application of Corollary [Ml with D = 0(1), and M = 0(m) 
shows that there exists n € N with log n x m log m such that 


mm 


{e“(^^),e“(£f,^)}> — 


771 2 


m 2 


+a(l—a) 


a 2 — — 

= m 2 


/ logn 
Vlog log 


n 


□ 
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